Search CORE

2,120 research outputs found

Top-Down Induction of Decision Trees: Rigorous Guarantees and Inherent Limitations

Author: Blanc Guy
Lange Jane
Tan Li-Yang
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. 11th Innovations in Theoretical Computer Science Conference (ITCS 2020)
Publication date: 17/11/2019
Field of study

Consider the following heuristic for building a decision tree for a function

f : \{0,1\}^n \to \{\pm 1\}

. Place the most influential variable

x_i

f

at the root, and recurse on the subfunctions

f_{x_i=0}

and

f_{x_i=1}

on the left and right subtrees respectively; terminate once the tree is an

\varepsilon

-approximation of

f

. We analyze the quality of this heuristic, obtaining near-matching upper and lower bounds:

\circ

Upper bound: For every

f

with decision tree size

s

and every

\varepsilon \in (0,\frac1{2})

, this heuristic builds a decision tree of size at most

s^{O(\log(s/\varepsilon)\log(1/\varepsilon))}

\circ

Lower bound: For every

\varepsilon \in (0,\frac1{2})

and

s \le 2^{\tilde{O}(\sqrt{n})}

, there is an

f

with decision tree size

s

such that this heuristic builds a decision tree of size

s^{\tilde{\Omega}(\log s)}

. We also obtain upper and lower bounds for monotone functions:

s^{O(\sqrt{\log s}/\varepsilon)}

and

s^{\tilde{\Omega}(\sqrt[4]{\log s } )}

respectively. The lower bound disproves conjectures of Fiat and Pechyony (2004) and Lee (2009). Our upper bounds yield new algorithms for properly learning decision trees under the uniform distribution. We show that these algorithms---which are motivated by widely employed and empirically successful top-down decision tree learning heuristics such as ID3, C4.5, and CART---achieve provable guarantees that compare favorably with those of the current fastest algorithm (Ehrenfeucht and Haussler, 1989). Our lower bounds shed new light on the limitations of these heuristics. Finally, we revisit the classic work of Ehrenfeucht and Haussler. We extend it to give the first uniform-distribution proper learning algorithm that achieves polynomial sample and memory complexity, while matching its state-of-the-art quasipolynomial runtime

arXiv.org e-Print Archive

Dagstuhl Research Online Publication Server

Learning about pain through observation: the role of pain-related fear

Author: France Christopher R
Goubert Liesbet
Lange Jane
Trost Zina
Vervoort Tine
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

Observational learning may contribute to development and maintenance of pain-related beliefs and behaviors. The current study examined whether observation of video primes could impact appraisals of potential back stressing activities, and whether this relationship was moderated by individual differences in pain-related fear. Participants viewed a video prime in which back-stressing activity was associated with pain and injury. Both before and after viewing the prime, participants provided pain and harm ratings of standardized movements drawn from the Photograph of Daily Activities Scale (PHODA). Results indicated that observational learning occurred for participants with high levels of pain-related fear but not for low fear participants. Specifically, following prime exposure, high fear participants showed elevated pain appraisals of activity images whereas low fear participants did not. High fear participants appraised the PHODA-M images as significantly more harmful regardless of prime exposure. The findings highlight individual moderators of observational learning in the context of pain

Ghent University Academic Bibliography

Agnostic proper learning of monotone functions: beyond the black-box correction barrier

Author: Lange Jane
Vasilyan Arsen
Publication venue
Publication date: 18/04/2023
Field of study

We give the first agnostic, efficient, proper learning algorithm for monotone Boolean functions. Given

2^{\tilde{O}(\sqrt{n}/\varepsilon)}

uniformly random examples of an unknown function

f:\{\pm 1\}^n \rightarrow \{\pm 1\}

, our algorithm outputs a hypothesis

g:\{\pm 1\}^n \rightarrow \{\pm 1\}

that is monotone and

(\mathrm{opt} + \varepsilon)

-close to

f

, where

\mathrm{opt}

is the distance from

f

to the closest monotone function. The running time of the algorithm (and consequently the size and evaluation time of the hypothesis) is also

2^{\tilde{O}(\sqrt{n}/\varepsilon)}

, nearly matching the lower bound of Blais et al (RANDOM '15). We also give an algorithm for estimating up to additive error

\varepsilon

the distance of an unknown function

f

to monotone using a run-time of

2^{\tilde{O}(\sqrt{n}/\varepsilon)}

. Previously, for both of these problems, sample-efficient algorithms were known, but these algorithms were not run-time efficient. Our work thus closes this gap in our knowledge between the run-time and sample complexity. This work builds upon the improper learning algorithm of Bshouty and Tamon (JACM '96) and the proper semiagnostic learning algorithm of Lange, Rubinfeld, and Vasilyan (FOCS '22), which obtains a non-monotone Boolean-valued hypothesis, then ``corrects'' it to monotone using query-efficient local computation algorithms on graphs. This black-box correction approach can achieve no error better than

2\mathrm{opt} + \varepsilon

information-theoretically; we bypass this barrier by a) augmenting the improper learner with a convex optimization step, and b) learning and correcting a real-valued function before rounding its values to Boolean. Our real-valued correction algorithm solves the ``poset sorting'' problem of [LRV22] for functions over general posets with non-Boolean labels

arXiv.org e-Print Archive

Learning Stochastic Decision Trees

Author: Blanc Guy
Lange Jane
Tan Li-Yang
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. 48th International Colloquium on Automata, Languages, and Programming (ICALP 2021)
Publication date: 01/01/2021
Field of study

Dagstuhl Research Online Publication Server

Decision Tree Heuristics Can Fail, Even in the Smoothed Setting

Author: Blanc Guy
Lange Jane
Qiao Mingda
Tan Li-Yang
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques (APPROX/RANDOM 2021)
Publication date: 01/01/2021
Field of study

Greedy decision tree learning heuristics are mainstays of machine learning practice, but theoretical justification for their empirical success remains elusive. In fact, it has long been known that there are simple target functions for which they fail badly (Kearns and Mansour, STOC 1996). Recent work of Brutzkus, Daniely, and Malach (COLT 2020) considered the smoothed analysis model as a possible avenue towards resolving this disconnect. Within the smoothed setting and for targets f that are k-juntas, they showed that these heuristics successfully learn f with depth-k decision tree hypotheses. They conjectured that the same guarantee holds more generally for targets that are depth-k decision trees. We provide a counterexample to this conjecture: we construct targets that are depth-k decision trees and show that even in the smoothed setting, these heuristics build trees of depth 2^{?(k)} before achieving high accuracy. We also show that the guarantees of Brutzkus et al. cannot extend to the agnostic setting: there are targets that are very close to k-juntas, for which these heuristics build trees of depth 2^{?(k)} before achieving high accuracy

arXiv.org e-Print Archive

Dagstuhl Research Online Publication Server